Unsupervised Machine Learning With 
Python: Clustering. Mean Shift Algorithm 


It is yet another well-liked and effective clustering method applied in unsupervised learning. It is a 
non-parametric method because it makes no assumptions. It is also known as mean shift cluster 
analysis or hierarchical clustering. The fundamental steps of this algorithm would be as follows: 


e We must begin by looking at the data points that have been given their own cluster. 

e The centroids are now computed, and the locations of new centroids are updated. 

e Repeating this procedure brings us closer to the apex of the cluster, or to the area of higher 
density. 

e The point where centroids stop moving is where this method terminates. 
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Let us employ this algorithm and train a model in the Python Programming Language. 
MEAN SHIFT ALGORITHM EXAMPLE 
We begin by importing all necessary packages into our Python script: 
import numpy as np 
import numpy as np 


from sklearn.cluster import MeanShift 
import matplotlib.pyplot as plt 


from matplotlib import style 


from sklearn.datasets import make_blobs 


We set an optional style for our matplotlib canvas: 


style.use("“ggplot") 


We generate dummy data: 


Centers: —si lee 
[4, 5], 
[3, 10]] 


make_blobs(n_samples = 500, 
centers = centers, 
cluster_std = 1) 


We may visualize the dummy data as follows: 


.-title("Mean Shift Algorithm Clusters") 
.xlabel("X-AXIS") 
-ylabel ("Y-AXIS") 


-scatter(X[:, @], X[:, 1]) 
. show() 


The output to the above block of code will show as follows: 
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Next, we may instantiate and train an object of the MeanShift class: 


algorithm = MeanShift() 


model = algorithm. fit(X) 


We may obtain the predicted clusters of our model, as well as the cluster centers: 


predictions = model.labels__ 


cluster_centers = model.cluster_centers_ 


We will be able to check the data points for the cluster centers of the model, as well as the suggested 
number of clusters that are detected: 


print(cluster_centers) 


print("Estimated Number Of Clusters:", n_clusters_) 


The output to the above lines of code will show as follows: 


[7 ‘Console 1/A | 4 


In [8]: runfile('C:/Users/Shivek/Desktop/BSc 
Degree Year 2/data preprocessing.py', 
wdir='C:/Users/Shivek/Desktop/BSc Degree Year 
| 

[[2.83363673 3.38385265] 


[3.03099751 9.99155061] ] 
Estimated Number Of Clusters: 2 


In [9]: 


IPython console History 


<2, LSP Python: ready @ conda (Python 3.8.8) Line 43, Col 1 Mem 72% 


It is good to note that the output we see in the above image and the graph below will differ each 
time you run your Python script. 


Finally, we may proceed to visualize our KMeans Model using MatPlotLib: 


i in range(len(X)): 
plt.plot(X[i][@], X[i][1], colors[predictions[i]], markersize = 10) 
plt.scatter(cluster_centers[:,0],cluster_centers[:,1], 


marker="x",color='k', s=15@, linewidths = 5, zorder=10) 


.title("Mean Shift Algorithm Clusters") 
.xlabel("X-AXIS") 

.ylabel ("Y-AXIS") 

. show() 


The visualization shows as follows: 
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